Introduction

Column

Rapping through the Decades: A Computational Analysis of 90s Hip-Hop and Modern Rap Music

Hip-hop music has been a major cultural force for several decades, with its roots in the African American and Latino communities of the Bronx in the 1970s. Over time, the genre has evolved and branched out into various sub-genres, including gangsta rap, conscious hip hop, and trap music, to name a few. In this course I’ve been doing research on the difference and similarities between popular ‘old school hip-hop’ from the 90s and popular modern day rap from around 2017 to 2023. Rap is now considered as one of the most popular genres in contemporary music. I’m interested in how the music has changed and if modern day rap has significant differences compared to 90s hip-hop that can be confirmed by using computational musicology; but I’m also interested in possible similarities. Therefore my corpus will contain a merged group of tracks from both these periods. In the presented analysis of this research I will refer to the popular hip-hop music of the 90s as ‘old-school’ and the popular rap music from around 2017 to 2023 as ‘modern’.

As I said, the natural groups and comparison points are popular tracks from both these time spans. One of my hypotheses is that I expect to see an increase in aspects like loudness because of the way music and hip hop beats get produced these days; but I am very unsure how other factors like energy, valence and danceability have changed between these two groups. Besides that I also assume that modern rap will feature more electronic and digital production elements than 90s hip-hop. I hope to determine these differences by looking at various aspects, such as timbre.

One strength of my corpus is that I respectively checked, with the help of popularity charts and the stream counts on Spotify, which tracks were the most popular in these time spans in both the 90s and around 2017 to 2023 and used these as a reference point. Hereby I ruled out a certain form of subjectivity and my own taste of tracks because I only focused on tracks that are popular and have a lot of streams. So despite that these two time spans differ a lot and the genre of hip hop/rap has drastically evolved; my comparison groups have one thing in common, they both consist of only really popular tracks. This also provides some interesting research questions: On the one hand, can we determine a notion of unity in both time spans that only consists of popular tracks, or is there a lot of fluctuation? And on the other hand, can we identify common characteristics for the whole corpus?

The SpotifyAPI track features provide a lot of information about how the tracks are classified and what different musical characteristics and qualities they convey. These track features therefore offer a reliable foundation to start off this research. You can find the playlist of the corpus here.

Column

PostMalone and Jay-Z

This is an AI generated image, made on Craiyon, of two very well known artists: on the left we see Post Malone and on the right Jay-Z. A fun blend of two iconic figures from different eras, with Jay-Z representing the 90s hiphop sound and Post Malone embodying the modern day rap style.

Valence and Energy

Column

Column

Is there a correlation between Valence and Energy?

To begin the visualisation of the selected corpus let’s take a look at two variables called energy and valence. How do they differ from the 90s to the modern day rap music and is there a correlation between the two variables? The measurement for both variables ranges from 0.0 to 1.0. Valence is referring to the emotional quality the music conveys: 1 being very positive, happy and/or uplifting and 0.0 being angry, regretful or sad. Energy gives us an idea about the intensity and the activity of the track. A really energetic track would feel rousing for example. We can see that there is more fluctuation in the modern rap music when we look at valence, the old-school graph has more tracks with higher valence compared to the later period, consisting of predominantly values above 0.5. Overall we see that the most popular hip-hop/rap songs from both time spans generally have high energy values, most of them being around 0.5 or higher. It is difficult to determine a clear correlation between energy and valence in these popular hip-hop/rap songs from both periods: because when we look at the old-school graph the highest values of energy are both located at the lowest and highest values of valence. One possible explanation for this could be that either low or high valence results in high energy. It’s important to mention that there are exceptions to this though, such as “Check the Rhime” from A Tribe Called Quest released in 1991 (the blue dot in the upper left corner of the “old-school” graph). There is one significant difference we can conclude between the old-school and modern graph: the modern period’s highest energy values do not correlate with either high or low valence values, where the old-school period has the highest energy on the extreme values of valence, both high and low. You can hoover over the dots of the scatter plot to see the track names, the exact release date and the exact values for both variables; if you are familiar with 90’s hip-hop or popular rap songs from recent years, you will probably recognize some of the track names!

Danceability and Speechiness

Column

Column

Column

Is there a correlation between Speechiness and Danceability?

These graphs examine the relationship between speechiness and danceability from both time periods (you can highlight one of the two periods by double-clicking on it in the legend). Danceability measures how suitable a track is for dancing, it’s primarly based on the rhythm, tempo and beat strength. It ranges from 0.0 to 1.0, with higher values indicating a greater likelihood of the track being danceable. Speechiness, on the other hand, measures the presence of spoken words in a track. It also ranges from 0.0 to 1.0, with higher values indicating that the track contains more ‘spoken word-like’ vocals. Without going in too much detail here, we are looking at two graphs which contain exactly the same data; the only differences are the trend lines, these are calculated by different methods called LOESS and GAM. One interesting and very extreme outlier here is “Yes Indeed” from Lil Baby featuring Drake; having both the highest danceability (almost at a maximum of 1.0) and the highest speechiness of the whole corpus. Because of this I’ve used two different methods for determining the trend lines. For the old-school period, both methods showed a relatively stable relationship between speechiness and danceability, with a upward trend at the beginning, peaking in between the speechiness values of 0.2 and 0.3. These findings suggest a significant trend in the data during the old-school period, where an increase in speechiness is associated with an initial increase in danceability, followed by a subsequent decrease. The cluster of data points (between the 0.2-0.3 range of speechiness) in the old-school period also nicely align with the peaks of both lines. The modern period also aligns and crosses with the old-school trend line around the spot of 0.23 speechiness, using the LOESS method. However, looking at this trend line of the modern period, we can see that the outlier “Yes Indeed” has a significant impact on the observed trend regarding the LOESS method. The GAM method, on the other hand, shows a horizontal line; which would indicate that the increase of speechiness doesn’t influence the danceability of the modern period in a significant way, whilst the old-school period tends to show a significant non-linear relationship using both methods. All in all we can conclude that there would be more analysis needed to confirm these assumptions but that the results are remarkable: Yes Indeed” has a very decisive impact on the modern period, but it’s also an important point of data which we can’t ignore. Another relevant general observation regarding both groups, is that the popular rap music significantly consists of more tracks which have less speechiness compared to the 90s popular hip-hop and that these tracks tend to show more fluctuation regarding the danceability. The graphs also confirms one observation made earlier and this actually is very important to emphasize and remember that what we are looking at is relative: the dataset exhibits a significant positive correlation between danceability and track popularity, with only a few tracks (three out of the whole corpus) having a score below 0.5 (one even being 0.497, which is extremely close to this threshold).

Comparing the Spotify Timbre Coefficients of 90s Hip-Hop and Modern Rap

Column

Column

What is timbre and what are timbre coefficients?

Timbre is everything about an audio file that is separated from the sound qualities pitch, duration and volume. It can be seen as a tone color or tone quality. For example, think about a violin and a piano that would both play a musical tone like C4: this tone could have the same pitch, same duration and the same volume but will differ in timbre. Timbre is a comprehensive concept that is sometimes difficult to put into words and can therefore be difficult to analyse. This graph contains the 12 timbre coefficients that Spotify uses to analyse an audio file and this is pretty ambiguous to interprete; the only people who have assigned a real meaning to them are Spotify engineers, and they’ve kept information about the exact meaning of these values internal. When we take a quick look, there seems to be a lot of similarity between the two, on the other hand, when we look more detailed, there are significant differences in shape and range in certain coefficients, for example at c02, c05, c07, c08 and c11, this confirms that the timbre of both groups differ. The first coefficient actually is mainly based on the loudness variable; so besides the fact volume usually isn’t a factor in timbre, Spotify uses loudness as one of their timbre coefficients. Try to take a good look at c01 before going to the next page, or feel free to come back to this page after visiting the next, although small, this graph has visualized the specific values of the loudness variable of this corpus very accurately and it is fun to see how you can recognize the shape of the boxplots I will present on the next page into these shapes!

Looking Further Into the Loudness Variable

Column

Column

Conclusions and Assumptions Based on the Boxplots

One of my hypotheses was that modern rap music would have higher loudness than 90s hiphop music. This boxplot shows that the modern rap music has a higher median, but that the 90’s actually has the highest max loudness value of the whole corpus! The range of loudness for 90s hip hop music (from -14.73 to -2.43) is way wider than that of modern rap music (from -9.31 to -3.37, when we leave out the outlier), this indicates that there is greater variability in loudness within 90’s hip hop music; the interquartile range from both boxplots are a good visual representation for this. Overall, the boxplots suggest that modern rap music tends to be louder than 90’s hip hop music on average; because it has a more concentrated distribution of loudness values and has a higher median whilst the IQR range is way smaller. Nevertheless, it is noteworthy to mention that the third quartile of the 90’s hip hop has a higher range than the modern rap music and, like I mentioned earlier, the loudest track of this corpus is from the 90’s. However, the possibility of these tracks being remastered or other factors that may have affected the loudness of the music cannot be ruled out based on the boxplots alone and this could be a really decisive factor on why the ‘old-school boxplot’ has the highest max loudness value. This corpus only consists of the most popular tracks and therefore it makes sense to assume these tracks have been remastered before they where uploaded on Spotify.

Comparing the Mean Tempo (BPM) of Both Periods

Column

Column

Conclusions and Assumptions Based on the Mean Tempo and the Standard Deviation

With this graph we are looking at the mean tempi of the all the tracks. The y-axis of the graph displays the standard deviation, which is a measure of the degree of variability of the data points from the mean value. In addition to this, the graph also includes information on two other variables: duration and volume. One first observation when looking at this graph is that the standard deviation overall is very low; we can conclude that there is not a lot of variance in the calculated means of the tempi. Something we would expect with hip-hop and rap music in general, and therefore also from both these periods, as the beats and rhythm usually are very repetitive and steady. It is a logical conclusion that this also explains why this type of music generally has such high danceability values, as we saw earlier. Besides this we can see a clear difference between the hip-hop from the 90’s and the modern rap music. As the old-school period (apart from the 3 outliers on the right) has significantly lower mean tempo than the modern rap music. One interesting observation, something we saw earlier in the valence and energy graph too, is that the modern rap music shows a lot of fluctuation compared to the old-school period. We can see a steady cluster of data around 140/150 BPM but also a lot of other different values of BPM, whilst the data from the old-school period has a really steady cluster of almost all the data (besides the 3 outliers with high BPM) from around 80 to roughly 115 BPM. This fluctuation can also be observed in the duration variable. With the modern tracks displaying a wider range of sizes, including very small ones (which means these songs are very); this stands in contrast to the old-school tracks, which tend to have a more uniform size. This observed variability in track duration, with a trend towards shorter tracks in the modern period, may be attributed to a shift in the music industry towards optimizing streaming revenue, as shorter tracks tend to have a higher repeat value which generates more streams and therefore higher profits.

One of the most streamed songs on Spotify

Looking at one of the outliers: “rockstar”

In the graph about energy and valence there were some remarkable outliers, one of them being the song “rockstar” from Post Malone featuring 21Savage which consists of the lowest valence of the whole corpus (0.129). This song is also the most streamed song of the corpus having above 2.7 billion streams and also holds the fifth spot on Spotify’s list of most-streamed tracks! Let’s take a look at the chroma of this song in the forms of a chromagram and a chordogram. A chromagram is used to identify the pitches that are played in a song. It contains all the 12 pitch classes (tone height is a irrelevant factor for chroma) from the Western tonal system. Each chroma vector shows the distribution of energy across the twelve chroma bands in a signal’s frame. Herefore it is a very usable tool to represent and analyse an audio-file, as it gives us a lot of insight about the harmonic and melodic movement of the track. We can also use these chroma vectors to make other grams, such as a chordogram; this also uses chroma, but with some computational magic, uses the pitches from the song to estimate played chords. The chroma from this song is actually really interesting as it conveys some musical, theoretical contradictions that are counterintuitive on first sight. When looking at the chromagram we see two dominant notes besides the C in the intro; the notes C#/Db and G are repeated regularly through out the song and could be seen as the most abundant notes of the chromagram. Besides these two notes we also see some repetitive magnitude in other notes, which at times even show some chromaticism. These first observations already can be seen as an explanation why the song has such low valence. An interesting nuance to this whole story thus far is that the SpotifyAPI concludes that this song is in F minor (key=5 and mode=0) while we can see that the note F throughout the song actually is one of the few notes that has very little magnitude apart from the ending. One logical explanation for a computational algorithm to conclude this would be that both the C#/Db key or G key don’t share the two notes (C#/Db and G), either in minor or major modes and therefore both are ruled out as one of the keys via this algorithm. I actually have my own theory about the key and I don’t agree that this song would be in F minor. I’m really positive that this song is in the key G minor, due to both my personal, musical intuition and logical reasoning. With this in mind let’s take a look at the chordogram, and I will elaborate on how I came to this reasoning.

Looking At the Chordogram

We can see that the short intro consists of a C minor chord, which plays the notes G, Db and C. (which also perfectly fit into G minor). After this short intro, throughout the whole song, A7 (A – C# – E – G) and Eb7 (Eb - G - Bb - Db) are predominantly present in the chordogram. When listening to “rockstar”, we can hear synthpads throughout the whole song, laying the foundation of the harmony. These are getting steadily repeated, which is nicely shown by the chordogram. I’ve played along with the really simplistic main melody of this song and I’m positive that this melody is in the key of G minor and goes like this: D - C - Bb - A, and eventually this melody expands a little and the A resolves to the tonic G. My assumption on why the chordogram provides this information is that the synthpads are detuned. And it is hard to layer out these chroma bands, as these synth pads are present throughout the whole song. This also explains why the chromagram we saw showed some remarkable chromatic energy distribution, which isn’t normally used in pop or modern day rap songs. Besides this detuning, the chordogram determined two chords in which both the note G is used, but when looking into both these chords we see chromatic notes (A-Bb and Eb-E); which is contradicting because both these chords are presented as one straight dark line throughout the whole song and chordogram, which would mean there would be a constant chromatic sound of these two groups of notes A-Bb and Eb-E, which as I said I think is product of detuned synths. The chords share the G, which I think is the tonic, and share a C#/Db (which I think is also due this detuning). Well, it is a very interesting observation and one thing I can recommend is that you decide for yourself with keeping this in mind! If you would like to find out, play along with the song, and I’m sure you will agree with one thing, that this song isn’t in F minor.

Old school hip hop song analysis

Column

Self-Similarity Matrices for “Juicy”

Let’s now have a look at a track from the 90s. One thing that immediately stands out is that the graph contains a lot of unity in terms of chroma. The timbre-based self-similarity matrix shows us the clear structure of the song. Roughly speaking we can see 3 big sections, which are the rapverses of Biggie Smalls and in contrast to these verses we see 2 little blocks and 1 big block at the end of the song which are the hooks which are being sung by a female artist.

In depth analysis of the song “Shoota”

Self-Similarity Matrices for “Shoota”


Here we are looking at a self-similarity matrix of the song “Shoota”, which is one of the centre points in the first graph when we are looking at valence and energy. This song in my opinion accurately represents a shift in the sound and style of hiphop, reflecting the evolution of the genre and the changing tastes and preferences of its audience. The delivery style and flow of the rapping are very different, it has a more melodic and sing-song approach in comparison with a more straight forward style of old school hip-hop. Shoota also features a somewhat more complex and layered production style than the old school hip hop songs and therefore I think it is an interesting song to look at and give a in depth analysis using multiple grams.

What is a Cepstogram?

What is the key of this song?


We can see that this song is in the key of C# minor, as it is dominantly present throughout the whole song, at the part where the drums come in and the other artist begins rapping. We see some more blending of keys, but still; the C# minor still contains the most energy.

Tempo of the track


Conclusion

Column

Conclusion/discussion